PaTre: A Method for Paralogy Trees Construction
نویسندگان
چکیده
Genomes can be described as a collection of clusters, the gene families, whose members are called paralogs. Paralogs are genes that most probably share duplication history and show a significant similarity in their sequences, even if they perform slightly different biological function. Among the different mechanisms that have led to an increase of the genomic information during biological evolution, gene duplication is probably the most important. To better understand duplication events, the first step is to investigate the history of the gene families in order to detect which duplication events have taken place, and in which relative (partial) order. Here we present a method, called PaTre, that, given a gene family, attempts to construct the paralogy tree of the family. We will work under the hypothesis that every family member derives from a duplication process of another member. By the term paralogy tree, we mean a directed tree in which the root represents the most ancient paralog of the family and each oriented arc (a, b) represents the existence of a duplication event from the template gene a to its copy b. Notice that gene a survives the event and can serve as a template of more than one duplication event; in fact, there can be more than one arc leaving a. PaTre uses new algorithmic techniques motivated by the specific application at hand. The reliability of the inferential process has been tested by means of a simulator that implements different hypotheses on the duplication-with-modification paradigm and on three examples of different biological gene families, belonging either to lower and higher organisms.
منابع مشابه
MetaPhOrs: orthology and paralogy predictions from multiple phylogenetic evidence using a consistency-based confidence score
Reliable prediction of orthology is central to comparative genomics. Approaches based on phylogenetic analyses closely resemble the original definition of orthology and paralogy and are known to be highly accurate. However, the large computational cost associated to these analyses is a limiting factor that often prevents its use at genomic scales. Recently, several projects have addressed the r...
متن کاملGoing nuclear: gene family evolution and vertebrate phylogeny reconciled.
Gene duplications have been common throughout vertebrate evolution, introducing paralogy and so complicating phylogenetic inference from nuclear genes. Reconciled trees are one method capable of dealing with paralogy, using the relationship between a gene phylogeny and the phylogeny of the organisms containing those genes to identify gene duplication events. This allows us to infer phylogenies ...
متن کاملGene Tree Construction and Correction using SuperTree and Reconciliation
The supertree problem asking for a tree displaying a set of consistent input trees has been largely considered for the reconstruction of species trees. Here, we explore this framework for the sake of reconstructing a gene tree from a set of input gene trees on partial data. The phylogenetic tree for the species containing the genes of interest can be used to choose among the many possible compa...
متن کاملPhylomeDB v3.0: an expanding repository of genome-wide collections of trees, alignments and phylogeny-based orthology and paralogy predictions
The growing availability of complete genomic sequences from diverse species has brought about the need to scale up phylogenomic analyses, including the reconstruction of large collections of phylogenetic trees. Here, we present the third version of PhylomeDB (http://phylomeDB.org), a public database for genome-wide collections of gene phylogenies (phylomes). Currently, PhylomeDB is the largest ...
متن کاملTemporal paralogy, cladograms, and the quality of the fossil record
Previous attempts to quantify the adequacy between phylogenetic trees (cladograms with a temporal dimension) and the fossil record failed because of inappropriate statistics. A general explanation for this failure is based on a hierarchical perception of the temporal scale. When time is conceived as a hierarchy and not as an arrow, it can be expressed by a pectinate-shaped tree. Comparison betw...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of computational biology : a journal of computational molecular cell biology
دوره 10 5 شماره
صفحات -
تاریخ انتشار 2003